Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A heuristic approach to author name disambiguation in bibliometrics databases for large‐scale research assessments

Identifieur interne : 000519 ( Main/Exploration ); précédent : 000518; suivant : 000520

A heuristic approach to author name disambiguation in bibliometrics databases for large‐scale research assessments

Auteurs : Ciriaco Andrea D'Angelo [Italie] ; Cristiano Giuffrida [Pays-Bas] ; Giovanni Abramo [Italie]

Source :

RBID : ISTEX:619AE2C1CD7F26BAA84A1F70343E7D9A59954D49

Abstract

National exercises for the evaluation of research activity by universities are becoming regular practice in ever more countries. These exercises have mainly been conducted through the application of peer‐review methods. Bibliometrics has not been able to offer a valid large‐scale alternative because of almost overwhelming difficulties in identifying the true author of each publication. We will address this problem by presenting a heuristic approach to author name disambiguation in bibliometric datasets for large‐scale research assessments. The application proposed concerns the Italian university system, comprising 80 universities and a research staff of over 60,000 scientists. The key advantage of the proposed approach is the ease of implementation. The algorithms are of practical application and have considerably better scalability and expandability properties than state‐of‐the‐art unsupervised approaches. Moreover, the performance in terms of precision and recall, which can be further improved, seems thoroughly adequate for the typical needs of large‐scale bibliometric research assessments.

Url:
DOI: 10.1002/asi.21460


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A heuristic approach to author name disambiguation in bibliometrics databases for large‐scale research assessments</title>
<author>
<name sortKey="D Angelo, Ciriaco Andrea" sort="D Angelo, Ciriaco Andrea" uniqKey="D Angelo C" first="Ciriaco Andrea" last="D'Angelo">Ciriaco Andrea D'Angelo</name>
</author>
<author>
<name sortKey="Giuffrida, Cristiano" sort="Giuffrida, Cristiano" uniqKey="Giuffrida C" first="Cristiano" last="Giuffrida">Cristiano Giuffrida</name>
</author>
<author>
<name sortKey="Abramo, Giovanni" sort="Abramo, Giovanni" uniqKey="Abramo G" first="Giovanni" last="Abramo">Giovanni Abramo</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:619AE2C1CD7F26BAA84A1F70343E7D9A59954D49</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1002/asi.21460</idno>
<idno type="url">https://api.istex.fr/document/619AE2C1CD7F26BAA84A1F70343E7D9A59954D49/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000B51</idno>
<idno type="wicri:Area/Istex/Curation">000B36</idno>
<idno type="wicri:Area/Istex/Checkpoint">000176</idno>
<idno type="wicri:doubleKey">1532-2882:2011:D Angelo C:a:heuristic:approach</idno>
<idno type="wicri:Area/Main/Merge">000525</idno>
<idno type="wicri:Area/Main/Curation">000519</idno>
<idno type="wicri:Area/Main/Exploration">000519</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A heuristic approach to author name disambiguation in bibliometrics databases for large‐scale research assessments</title>
<author>
<name sortKey="D Angelo, Ciriaco Andrea" sort="D Angelo, Ciriaco Andrea" uniqKey="D Angelo C" first="Ciriaco Andrea" last="D'Angelo">Ciriaco Andrea D'Angelo</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Laboratory for Studies of Research and Technology Transfer at University of Rome “Tor Vergata,” Via del Politecnico 1, 00133 Rome</wicri:regionArea>
<placeName>
<settlement type="city">Rome</settlement>
<region nuts="2">Latium</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Giuffrida, Cristiano" sort="Giuffrida, Cristiano" uniqKey="Giuffrida C" first="Cristiano" last="Giuffrida">Cristiano Giuffrida</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Department of Computer Science, Vrije Universiteit, De Boelelaan 1081A, 1081 HV Amsterdam</wicri:regionArea>
<wicri:noRegion>1081 HV Amsterdam</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Abramo, Giovanni" sort="Abramo, Giovanni" uniqKey="Abramo G" first="Giovanni" last="Abramo">Giovanni Abramo</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>National Research Council of Italy and Laboratory for Studies of Research and Technology Transfer at University of Rome “Tor Vergata,” Dipartimento di Ingegneria dell'Impresa, Universitàdegli Studi di Roma “Tor Vergata,” Via del Politecnico 1, 00133 Rome</wicri:regionArea>
<placeName>
<settlement type="city">Rome</settlement>
<region nuts="2">Latium</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Journal of the American Society for Information Science and Technology</title>
<title level="j" type="abbrev">J. Am. Soc. Inf. Sci.</title>
<idno type="ISSN">1532-2882</idno>
<idno type="eISSN">1532-2890</idno>
<imprint>
<publisher>Wiley Subscription Services, Inc., A Wiley Company</publisher>
<pubPlace>Hoboken</pubPlace>
<date type="published" when="2011-02">2011-02</date>
<biblScope unit="volume">62</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="257">257</biblScope>
<biblScope unit="page" to="269">269</biblScope>
</imprint>
<idno type="ISSN">1532-2882</idno>
</series>
<idno type="istex">619AE2C1CD7F26BAA84A1F70343E7D9A59954D49</idno>
<idno type="DOI">10.1002/asi.21460</idno>
<idno type="ArticleID">ASI21460</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1532-2882</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">National exercises for the evaluation of research activity by universities are becoming regular practice in ever more countries. These exercises have mainly been conducted through the application of peer‐review methods. Bibliometrics has not been able to offer a valid large‐scale alternative because of almost overwhelming difficulties in identifying the true author of each publication. We will address this problem by presenting a heuristic approach to author name disambiguation in bibliometric datasets for large‐scale research assessments. The application proposed concerns the Italian university system, comprising 80 universities and a research staff of over 60,000 scientists. The key advantage of the proposed approach is the ease of implementation. The algorithms are of practical application and have considerably better scalability and expandability properties than state‐of‐the‐art unsupervised approaches. Moreover, the performance in terms of precision and recall, which can be further improved, seems thoroughly adequate for the typical needs of large‐scale bibliometric research assessments.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Italie</li>
<li>Pays-Bas</li>
</country>
<region>
<li>Latium</li>
</region>
<settlement>
<li>Rome</li>
</settlement>
</list>
<tree>
<country name="Italie">
<region name="Latium">
<name sortKey="D Angelo, Ciriaco Andrea" sort="D Angelo, Ciriaco Andrea" uniqKey="D Angelo C" first="Ciriaco Andrea" last="D'Angelo">Ciriaco Andrea D'Angelo</name>
</region>
<name sortKey="Abramo, Giovanni" sort="Abramo, Giovanni" uniqKey="Abramo G" first="Giovanni" last="Abramo">Giovanni Abramo</name>
<name sortKey="Abramo, Giovanni" sort="Abramo, Giovanni" uniqKey="Abramo G" first="Giovanni" last="Abramo">Giovanni Abramo</name>
<name sortKey="D Angelo, Ciriaco Andrea" sort="D Angelo, Ciriaco Andrea" uniqKey="D Angelo C" first="Ciriaco Andrea" last="D'Angelo">Ciriaco Andrea D'Angelo</name>
</country>
<country name="Pays-Bas">
<noRegion>
<name sortKey="Giuffrida, Cristiano" sort="Giuffrida, Cristiano" uniqKey="Giuffrida C" first="Cristiano" last="Giuffrida">Cristiano Giuffrida</name>
</noRegion>
<name sortKey="Giuffrida, Cristiano" sort="Giuffrida, Cristiano" uniqKey="Giuffrida C" first="Cristiano" last="Giuffrida">Cristiano Giuffrida</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000519 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000519 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:619AE2C1CD7F26BAA84A1F70343E7D9A59954D49
   |texte=   A heuristic approach to author name disambiguation in bibliometrics databases for large‐scale research assessments
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024